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Abstract. The mild sufficient conditions for exponential ergodicity of a Markov process, defined as the 
solution to SDE with a jump noise, are given. These conditions include three principal claims: recurrence 
condition R., topological irreducibility condition S and non-degeneracy condition N, the latter formulated in 
the terms of a certain random subspace of R m , associated with the initial equation. The examples are given, 
showing that, in general, none of three principal claims can be removed without losing ergodicity of the process. 
The key point in the approach, developed in the paper, is that the local Doeblin condition can be derived from 
N and S via the stratification method and criterium for the convergence in variations of the family of induced 
measures on R m . 



Introduction 

In this paper, we study ergodic properties of a Markov process X in R m , given by an SDE 



(0.1) dX(t) = a(X(t))dt + c(X(t-),u)9(dt,du)+ c{X{t-),u)v{dt,du). 

J\\u\\<i "MMI>i 

Here, v is a Poisson point measure, v is correspondent compensated measure, and coefficients a, c are supposed 
to satisfy usual conditions sufficient for the strong solution of (|0.ip to exist and be unique. Our aim is to give 
sufficient conditions for exponential ergodicity of (|0.1|) . that impose as weak restrictions on the Levy measure 
of the noise, as it is possible. 

There exists two well developed methods to treat the ergodicity problem for the discrete time Markov 
processes, valued in a locally compact phase space. The first one is based on the coupling technique (see 
detailed overview in [H]), the second one uses the notions of T- chain and petite sets (see [2T], [5] and references 
therein). These methods can be naturally extended to continuous time case either by making a procedure of 
time discretization (like it was made for the solutions to SDE's with jumps in the recent paper [20]), or by 
straightforward use of the coupling technique in a continuous time settings (see [27], [25] for such kind of a 
technique for diffusion processes). Typically in the methods mentioned above, the two principal features 
should be provided: 

- recurrence outside some large ball; 

- regularity of the transition probability in some bounded domain. 

The first feature can be provided in a quite standard way via an appropriate version of the Lyapunov 
criterium (condition R in Theorem 11.11 below) . The second one is more intrinsic, and requires some accuracy 
in the choice both of the concrete terms, in which such feature is formulated, and of the conditions on the 
process, sufficient for such feature to hold true. We deal with the form of the regularity feature, that is usually 
called the local Doeblin condition, and is formulated below. 

LD. For every R > 0, there exists T = T(R) > such that 



inf / {Pi AP?}(dz)>0, 
\\x\\,\\v\\<RJm.™ 
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where Pj(-) = P(X(t) e -|^(0) = x), and, for any two probability measures fi, x, 

(fx + x){dz) 



\pb A x\ (dz) = min 



da dx 



+ x) ' c?(/x 

The non-trivial question is what is the proper form of the conditions on the coefficients a, c of the equation 
(jO.ip and the Levy measure of the noise, sufficient for the local Doeblin condition to hold true. In a diffusion 
settings, standard strong ellipticity (or, more general, Hormander type) non-degeneracy conditions on the 
coefficients provide that the transition probability of the process possesses smooth density w.r.t. Lebesgue 
measure, and thus LD holds true (|27).[2"5j). In a jump noise case, one can proceed analogously and claim 
the transition probability of the solution to (|0.1[) to possess a locally bounded density (exactly this claim was 
used as a basic assumption in the recent paper |20]L However, in the latter case such kind of a claim appears 
to be too restrictive; let us discuss this question in more details. Consider, for simplicity, equation (|0.1[l with 
c(x,u) — u, i.e. a following non-linear analogue of the Ornstein-Uhlenbeck equation: 

(0.2) dX{t) = a(X(t))dt + dU u 

where Ut — J* J^ u ^ <1 c(u)ui>(ds,du) + J* j^ u ^ >l c{u)v{ds,du) is a Levy process. There exist two methods to 
provide the process defined by (|0.2j) to possess a bounded (moreover, belonging to the class C°°) transition 
probability density. The first one was initially proposed by J.Bismut (see [3] , [2] , [IH] , [IS] ) , the second one - 
by J.Picard (see [H],[I3])- Both these methods require, among others, the following condition on the Levy 
measure IT of the process U to hold true: 

(0.3) 3pe(0,2): e~ p [ ||7i|| 2 II(<iu) -> oo, e^0+. 

J\\u\\<e 

This limitation is not a formal one. It is known (see |18j . Theorem 1.4), that if 



(0.4) liminf 

e— >0+ 



e z In I - 



V 1 



sup / [\(u,v)\ Ae] 2 U(du) = 0, 



then the transition probability density, if exists, does not belong to any L p ^ oc {W n ),p > 1, and therefore is not 
locally bounded (note that (|0.4p implies that (|0.3[) fails). One can say that when the intensity of the jump 
noise is "sparse near zero" in a sense of (|0.4I) . the behavior of the density essentially differs from the diffusion 
one, and the density either does not exist or is essentially irregular. 

On the other hand, let us formulate a corollary of the general ergodicity result, given in Theorems 1 1.1 11.31 
below. 

Proposition 0.1. Let m = 1, suppose that a(-) is locally Lipschitz on K and lim sup < 0. Suppose 

\x\ — >+oo 

that the Levy measure H of the process U satisfies the following conditions: 
(i) there exists q > 0: J^ >1 \u\ q H(du) < +oo; 

(a) n(E\{o» + o. 

Then the solution to i f is exponentially ergodic, i.e. its invariant distribution [ii nv exists and is unique, 
and, for some positive constant C, 

(0.5) VseR, ||Pi-Mim>IUr = 0(exp[-Q]), t^+oo. 

In this statement, the non-degeneracy condition (ii) on the jump noise, obviously, is the weakest possible 
one: if it fails, then (|0.2p is an ODE, and (|0.5p fails also. We can conclude, that the proper conditions on the 
jump noise, sufficient to provide exponential ergodicity of the process, defined by (|0.2p . are much milder than 
the conditions that should be imposed in order to provide that this process possesses regular (locally bounded 
or even locally L p -integrable) transition probability density. 

Our way to prove the local Doeblin condition for the solution to (jO.ip strongly relies on the finite-dimensional 
criterium for the convergence in variations of the family of induced measures on R m . This criterium was 
obtained in pQ (the case m = 1 was treated in [7]). Via the stratification method (for the detailed exposition of 
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this topic see [5] , Section 2) this criterium can be extended to any probability space with a measurable group of 
admissible transformations (-o admissible family), that generates measurable stratification of the probability 
space (for a details, see Section 2 below). The key point is that the criterium for the convergence in variations 
of the family of induced measures is local in the following sense: such a convergence holds true, as soon as 
the initial probability measure is restricted to any set, where the gradient (w.r.t. given admissible family) of 
the limiting functional is non-degenerate. We impose a condition (condition N in Theorem 11.31 below) . that 
implies existence of an admissible family, such that the solution to (jO.ip possesses a gradient w.r.t. this family, 
and this gradient is non-degenerate with positive probability. Under this condition, the (local) criterium for 
convergence in variation provides the local Doeblin condition in a small ball ( Lemma 13.11 below) . Together 
with topological irreducibility of the process (provided, in our settings, by condition S of Theorem 1 1 -3[) . this 
gives condition LD. 

In our construction, we use time-stretching transformations of the Levy jump noise. Such a choice is not the 
only possible; for instance, one can use groups of transformations, varying values of the jumps (see [5j), and 
obtain another version of the non-degeneracy condition N. In order to make exposition compact, we do not 
give exact formulation and proof of the correspondent statement, although the general scheme is totally the 
same. Let us just outline that the main advantage of choice of the differential structure, made in the present 
article, is that, for the Levy jump noise, time-stretching transformations, unlike transformations of the phase 
variable, are admissible without any regularity claim on the Levy measure of the noise. 

The structure of the paper is the following. In Section 1, we formulate the main statements of the paper. In 
Section 2, we give necessary background from the stochastic calculus involving time-stretching transformations 
of the Levy jump noise, and its applications to the convergence in variation of the distributions of the solutions 
to SDE's with such noise. In Section 3, we prove the main statements of the paper. Sufficient conditions for 
the basic conditions R,N,S from Section 2, easy to deal with, are given in Section 4. In Section 5, we 
give counterexamples showing that, in general, none of the basic conditions can be removed without losing 
ergodicity of the process. 



Let us introduce notation. Everywhere below v is a Poisson point measure on R + x M. d , LI is its Levy measure 
and v{dt,du) = v(dt,du) — dtH(du) is the correspondent compensated point measure. We denote by p(-) the 
point process associated with v and by D the domain of p(-). Later on, we will impose such a conditions on 
the coefficients a, c of the equation (jO.ip . that this equation, endowed by the initial condition X(0) = x G R m , 
has the unique strong solution {X(t) = X(x,t),t > 0}, that is a process with cadlag trajectories. We denote 
by P M the distribution in B(R+,M m ) of the solution X(-) to ([OTj) with Law(A(0)) = fj,, by E M the expectation 
w.r.t. P M , and by P^ the distribution of X(t) w.r.t. P M . In particular, we denote P x = Pg x ,P* = P\ ,xG R m . 

All the functions used below are supposed to be measurable (jointly measurable w.r.t. (x,u), if necessary). 
The gradient w.r.t. the variable z is denoted by V z . The unit matrix in R m is denoted by %m. We use the 
same notation || • || for the Euclidean norms both in R m and R d , and for an appropriate matrix norms. The 
open ball in R m with the center x and radius R is denoted by Bim (i, R). The space of probability measures 
on R m is denoted by 3 , the total variation norm is denoted by ||-|L or . The (closed) support of the measure 
ji G y is denoted by supp /i. For /i e 7 and non-negative measurable function <fi : R m — > R, we denote 



The coefficient a is supposed to belong to the class C 1 (R™,R™) and to satisfy the linear growth condition. 
In our considerations, we will deal with the following two types of SDE's with a jump noise. 

A. Moderate non-additive noise. The SDE of the type (|0.1|) with the jump coefficient c dependent on 
space variable x. We claim the following standard conditions to hold true: 



1. THE MAIN RESULTS 




(1.1) 



\\c(x,u) - c(y,u)\\ < K{1 + \\u\\)\\x - y\\, \\c{x,u)\\ < V*(z)NI 



u e R d ,x,y e R ; 



m 
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with some constant K 6 R + and some function : R d — ► R + satisfying linear growth condition. We also 
claim the following specific moment condition: 

(1.2) / sup [||c(a:,ti)|| + [|V x c(a;,tt)[| Rm 2] H.(du) < +oo, R G R+ 

JR d ||a;||<-R 

(the gradient \J x c{x,u) is supposed to exist, and to be continuous w.r.t. x). We interpret this condition in a 
sense that the jump part of the equation is moderate. 

B. Arbitrary additive noise. The SDE of the type (jO.ip with the jump coefficient c that does not depend 
on space variable x: c(x,u) — c(u) and ||c(u)|| < No moment conditions like (|1.2p are imposed on the 

jump part. In this case, IjO.ip is a non-linear analogue (|0.2[) of the Ornstein-Uhlenbeck equation. Making a 
change v(-) i— > u c (-), f c ([0, t] X A) = z/([0, t] x cr 1 (A)), one can reduce (|0.2p to the same equation with m = d 
and c(u) = u. In order to simplify notation, in a sequel we consider such an equations only. 

In the both cases given above, equation (|0.1|) . endowed by the initial condition X(0) = x, has the unique 
strong solution, that is a Feller Markov process with cadlag trajectories, and {P*(-),i G R + ,£ G R m } is its 
transition probability. In the case A, the trajectories of this solution a.s. have bounded variation on every 
finite interval. 

Denote, like in (20], 

Q = {/ S C 2 (M m ,M)|3/ locally bounded such that / f(x + c(x,u))U(du) < f(x), x 6 R'™}, 

J\\u\\>i 

and, for / G Q, write 

Af{x) = f [/(a; + c(a: ) u))-/(a:)-(V/(a:),c(ir ) u))R».I| M |< 1 ]n(du), x G R m . 

Let us formulate two general statements concerning convergence rate of to the ergodic distribution of X 
and estimates for the j3- mixing coefficients of X. 

Theorem 1.1. Suppose that the LD condition holds true together with the following recurrence condition: 
R. There exist function <p G Q and constants a, / y > such that 

A(p < — cx<fi + 7 an <^ ( / > ( ;z; ) — * +oo, Ija^H - * +oo. 

Then the process X possesses unique invariant distribution fii nv G CP, and there exist positive constants 
Ci, C2 suc/i that, for every /1 G CP ruif/i < +00, 

(1.3) IIP^-M^II^^Cx^ + ^expt-Qt], i G R+. 
Recall that the /3-mixing coefficient for X is defined by 

/J„(t) = Sup E p ||P M (-|^) - P M (OII Bor ffo. , < G R+, 

where 3^ = tr(X(s), s G [a, 6]), P M (-|9 r g) denotes the conditional distribution of P^ w.r.t. 3q, an d 

HLr,g = SU P MB,) - k{B 2 )]. 

B 1 CiB 2 = 0, B 1 l)B 2 = R m 
Si,Sa G 9 

If 

A* — (i>inv> then /3(-) = (3^ inv (•) is the mixing coefficient of the stationary version of the process X . 

Theorem 1.2. Suppose conditions of Theorem \l.l\ to hold true. Then 

(i) for every [i G CP with (f>(/J,) < +00 

(1.4) (3^(t) <Ci[0M + l]exp[-C 2 t]; 

(ii) <fi{(J,i nv ) < +00, and thus the mixing coefficient /?(•) allows the exponential estimate {H\j . 
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In order to shorten exposition, we do not formulate here typical applications of the estimates of the type 
(|1.4|) . such as Central Limit Theorem, referring the reader to the literature (see, for instance, Theorem 4 [28]). 

The following theorem, that gives sufficient conditions for condition LD to hold true, is the main result of 
the present paper. We need some additional notation. 

In the case A, put &(•) = a(-) — Ji| u ||<i c ("i u)Tl(du), and denote 

A (a;, u) — [a(x + c(x, u)) — a(x)] — [V x c(aj, u)]a(x). 

In the case B, denote A(x, u) — [a(x + u)) — a(x)] (note that if in the case B condition (|1.2| holds true, then 
these two formulas define the same function). Denote, by {£^,0 < s < t}, the solution to the linear SDE in 

£*=/ Rm + f Va(X{r))E r s dr+ f f W x c{X{r-))l r s - v(dr,du) + f f V x c(X(r-))£^ v(dr,du) 

Js Js J\\u\\<l Js J\\u\\>l 

(£* is supposed to be right continuous w.r.t. both time variables s,t), and define the random linear space St 
as the span (in R m ) of the set 

{4A(I(r-), P (r)) 1 rEl)n(0 ) t)}. 

Theorem 1.3. Let the two following conditions to hold true. 
N. There exist x* G R m , f* > such that 

P x ,(S t , =M m ) >0. 
S. For any R > there exists t = t(R) such that 

x* e suppP^, ||x|| < R. 

Then condition LD holds true. 

In Section 4 below, we give some sufficient conditions for R,N,S to hold true, formulated in the terms of 
the coefficients of the equation (|0.1j) and Levy measure of the jump noise. 

2. TlME-STRETCHING TRANSFORMATIONS AND CONVERGENCE IN VARIATION OF INDUCED MEASURES 

In this section, we give necessary background for the technique involving time-stretching transformations of 
the Levy point measure, with its applications to the problem of convergence in variation of the distributions 
of the solutions to SDE's with jumps. This technique is our tool in the proof of Theorem 11.31 Some of the 
statements we give without proofs, referring to the recent papers [18j.|17j. 

Denote H = L 2 {R+), H Q = L OQ (R+) n L 2 (R + ), Jh(-) = J Q h(s) ds, h e H. For a fixed h G H , we define the 
family {T^,t £ R} of transformations of the axis R + by putting T^x,x G R + equal to the value at the point 
s = t of the solution to the Cauchy problem 

(2.1) z' Xth (s) = Jh{z x>h (s)), seR, z x , h (0) = x. 

Denote Th = T^, then T s h o T t h = Tu+t)h (|18j). This means that T/j = {T t h, t G R} is a one-dimensional 
group of transformations of the time axis R + . It follows from the construction that 4? \t=oT t hX = Jh(x), x G R + . 
We call Th the time stretching transformation because, for h G C(R + ) H Hq, it can be informally described 
in the following way: every infinitesimal segment dx of the time axis should be stretched by e h ^ times, and 
then all the stretched segments should be glued together, preserving initial order of the segments ([15]). 

Denote II/j„ = {F G CB(R d ),II(r) < +00} and define, for h G H ,T G Ufi n , a transformation Tf L of the 
random measure v by 

[T};v]([0,t] x A) = u([0,Tht] x (Anr))+i/([0,ij x (A\F)), t GR+,A eU fm . 

Further we use the standard terminology from the theory of Poisson point measures without any additional 
discussion. The term " (locally finite) configuration" for a realization of the point measure is frequently used. 
We suppose that the basic probability space (SI, 5F, P) satisfies condition 5F = cr(i/), i.e. every random variable 
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is a functional of v. This means that in fact one can treat SI as the configuration space over K + x (R d \{0}) 
with a respective cr-algebra. The image of a configuration of the point measure v under Tfi can be described 
in a following way: every point (r, x) with x g" T remains unchanged; for every point (r, x) with x £ I\ its 
"moment of the jump" r is transformed to T_^r; neither any point of the configuration is eliminated nor any 
new point is added to the configuration. 

For h £ Hq,T £ IIji n transformation T£ is admissible for i/, i.e. the distributions of the point measures v 
and Tl v are equivalent ([18 ). This imply that (recall that 1 — <j{u)) the transformation Tj( generates the 
corresponding transformation of the random variables, we denote it also by T£. 

Define 6 as the set of functionals / £ n p L p (ft, P) satisfying the following condition: for every V £ Hfi n , 
there exists the random element V^/ £ n p L p (D,, P, H) such that, for every h £ Hq, 

(2-2) (V&/,/i)jr = limi[T^o/-/] 

with convergence in every L p ,p < +oo. 
Denote 

poo 

( P r ,h) = - h{t)v{dt,T), heH ,reu fm . 

Jo 

Lemma 2.1. ([18 ], Lemma 3.2). For every T £ Hfin, the pair (V^, 6) satisfies the following conditions: 

1) For every f u . . . , f n e 6 and F e C^W 1 ), 

71 

F(A, ...,/„) S 6 and V ff F(/i, ...,/„) = ^ ^(/i, ... , / n )V„/ fc 

fe=i 

^ chain rule). 

2) The map p T : i— » (p r , /i) is a weafc random element in H with weak moments of all orders, and 

E(v r H f,h) H = -Ef( P T ,h), he H, fee 

(integration-by-parts formula). 

3) There exists a countable set Co C C such that <r(So) = 3 '. 

The construction described before gives us the family T = {T^,h € ifo,r G Il/m} of the admissible 
transformations of the probability space (SI, P), such that the probability P is logarithmically differ entiable 
w.r.t. every T^ with the correspondent logarithmic derivative equal (p r ,h). This allows us to introduce 
a derivative w.r.t. such a family, that is an analogue to the Malliavin derivative on the Wiener space or 
the Sobolev derivative on the finite-dimensional space. However, the structure of the family T differs from 
the structure of the family of the linear shifts: for instance, there exist h,g e Ho,T £ Hfin such that 
o Tg ^ Tg o T£. This feature motivates the following "refinement" , introduced in [TB], of the construction 
described before. 

Definition 2.2. A family 9 = {[di, bi) C K + , hi £ Hq, Ti £ Hfi n , i £ N} is called a differential grid (or simply 
a grid) if 

(i) for every i ^ j, ([a^h) x n ([aj,bj) x vA = 0; 

(ii) for every i e N, Jhi > inside (a^fe^) and Jhi = outside (aj,6,). 

Any grid 9 generates a partition of some part of the phase space R + x (R d \{0}) of the random measure 
v into the cells {9i = [o>i,bi) x T,}. We call the grid 9 finite, if 9; = for all indices i £ N except some 
finite number of them. Although while studying some other problems (such as smoothness of the transition 
probability density for X, see |18| ) we typically use infinite grids, in our current exposition we can restrict 
ourselves by a finite grids with the number of non-empty cells equal to m (recall that m is the dimension of 
the phase space for the equation (|0.1[l ). Thus, everywhere below we simplify notation from |18j and consider 
the grids 9 with index i varying from 1 to m. 
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Denote Tj = T . For any i < to, s, s £ M, the transformations Tj,Tj commute because so do the time axis 
transformations T s h t ,Tshi ■ Transformation Tj does not change points of configuration outside the cell Si and 
keeps the points from this cell in it. Therefore, for every i, i < to, s, s £ R, transformations T*,T| commute, 
which implies the following proposition ( [1 S] ) . 



Proposition 2.3. For a given grid S fl^^ i = {ti, ■ ■ ■ , t m ) £ R m , define the transformation 

T^T^oTlo-.-oTH. 

Then I 9 — {T^,l £ W 71 } is the group of admissible transformations of Q which is additive in the sense that 



T* T 9 oT 9 )t l,2 e 



It can be said that, by fixing the grid S, we choose from the whole family of admissible transformations 
{T^,h £ Ho,T £ Hfin} the additive sub- family, that is more convenient to deal with. The following lemma 
describes the differential properties of the solution to (jO.ip w.r.t. this family. We denote by X(x, •) the strong 
solution to (ftHj) with X(0) = x. 

Lemma 2.4. I. In the case B, for every t £ R + , T £ Ylfi n , every component Xk(t), fc = 1, . . . , m of the vector 
X(t) belongs to the class C. For every h £ Hq,x £ W 71 , the process 

Y h ' T {x, t) = {{V v H X x {x, t), h) H ,..., (V r H X m (x, t), h) H ) T , x£R m ,t£R+ 

satisfies the equation 

(2.3) Y KT {x,t) = I [ A(X(x,s-),u)Jh(s)v(ds,du)+ [ [Va]{X(x,s))Y h ' r {x,s)ds, t > 0. 

Jo Jr Jo 

II. In the case A, for every x £ R m , t £ R + , Y £ Ylfi n , h £ Hq, every component of the vector X(x, t) is a.s. 
differentiable w.r.t. {T^ h ,r £ M}, i.e., there exist a.s. limits 

(2.4) Y£' r (x,t) = Vm^[T* h X k (x,t)-X k (x,t)], fc = l,...,m. 

The process Y h,T {x,t) = (Y 1 ' (x, f), . . . , Y^ T (x, t)) T satisfies the equation 

Y h > r (x,t) = [I A(X(x,s-),u)Jh(s)v(ds,du)+ 
J J[o,t] xr 

(2.5) + I [Va]{X{x,s))Y hX {x,s)ds+ 1 1 [V x c}(X(x,s-),u)Y h ' r {x,s-)i)(ds,du), i > 0. 

JO JJ[0,t]xR d 

Statement I is proved in [18], Theorem 4.1; statement II is proved in [16], Lemma 4.1. 
Remark. Solutions to equations (12. 3|) . (|2.5j) can be given explicitly: 

(2.6) Y h ' r (t)= If Jh{s)-£, t s A(X(x,s-),u)p(ds,du)= Jh ^ ' ^ A ( x ( x > t -),p( t ))- 



""M xr reD, P (T)er 

For a given grid 9 = {[a>i,h) C R + ,/i; £ H ,Yi £ Uf in ,i < to}, denote Y S:l — Y hi - Fi , i — 1, . . . ,to and 
consider the matrix-valued process 

Y 3 (x,t) = (Yi(x,t))™ k=1 , t£R+. 

The following lemma is the key point in our approach. The statement of the lemma is formulated for the cases 
A and B simultaneously. 

Lemma 2.5. Let x £ W n ,t > be fixed, denote Q x . t = {det Y 9 (x, t) ^ 0}. Then 
(2.7) P|^,o[X(y,t)]- 1 ^;P|^,o[X(x,t)]- 1 , y^x. 

The proof is based on the following criterium for convergence in variation of induced measures on a finite- 
dimensional space, obtained in [1 . 
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Theorem 2.6. Let F, F„ : R m R m be measurable functions, that have the approximative derivatives 
VP n , VF a.s. w.r.t Lebesgue measure X m , and E G 23(R m ) has finite Lebesgue measure. Suppose that F n — > F 
and VP n — > V-F in a sense of convergence in measure A™ and det V-F 7^ a.s. on E. Then the following 
statements are equivalent. 

(i) for every measurable A C E \" 1 \a F^ 1 — ^* X m \ A F _1 , n — > +00; 

(ii) for every measurable A C E and every 5 > there exists a compact set Kg C A such that \ m (A\Kg) < S 
and lim n ^ +oc X m {F n (K s )) = X m (F(K s )). 

In the situation, described in the preamble of the Theorem 12. 61 both (i) and (ii) can fail (see, for instance, 
Example 1.2 [T7]). Thus, in order to provide (i) (that is our goal), we should impose some additional conditions 
on the sequence {-F„}, sufficient for (ii) to hold true. The following two sufficient conditions were proved in 
[I], Corollaries 2.5 and 2.7, and in |17) . Theorem 3.1, correspondingly. 

Proposition 2.7. I. Let FF„ G W p 1 ; oc (R m ,R m ) with p > m (W^ loc denotes the local Sobolev space), and 
F n — > F, n — > 00 w.r.t. Sobolev norm || • ||w 1 (R m ,R" 1 ) 071 every ball. Then 

(2.8) X m \ A o F,- 1 ^ \ m \ A o F- 1 ,n -> +00 /or every measurable A C {detV-F ^ 0}. 

II. Let in the situation, described in the preamble of the Theorem \2.6l the sequence {F n } be uniformly 
approximatively Lipschitz. This, by definition, means that for every 5 > 0, R < +00 there exist a compact 
set Ks t R and a constant L$,r < +00 such that X m (B^m (0, R)\Ks) < S and every function F u \k s is a Lipschitz 
function with the Lipschitz constant L$,r. Then 12. j|) holds true. 

Proof of the Lemma \2.5l Take the sequence y n — > x,n — * +00, and denote / = X(x,t), f n = X(y n ,t). 
It is proved in [IS] (proof of Theorem 3.1), that the group T s = {T s s ,s € R 1 ™} generates a measurable 
parametrization of (17, 3", P), i.e. there exists a measurable map $ : 17 — > R" 1 x 17 such that 17 is a Borel 
measurable space and the image of every orbit of the group T s under $ has the form L x {to}, where w G 17 
and L is a linear subspace of R m . The linear subspace L differs from R™ exactly in the case, when the orbit 
T s {uj} is built for such an u, that, for some i = 1, . . . , m, v{{ai, hi) x I\) = (i.e., some T % does not change 
ui). For every ui of such a type detY s (x,t) — 0, and thus we need to investigate the laws of /, /„, restricted 
to 17g = {v((ai, bi) x Ti) > 0, i = 1, . . . , to}, only. 

The measure P|n s can be decomposed into a regular family of conditional distributions such that every 
conditional distribution is supported by an orbit of the group (see, for instance, |23j). Therefore we can 
write 

(2.9) P(A) = [ P„([A]„)ir(iu), AeJnn s , 

where = {s G R m |(s,n7) G A}, tt is the image of P|o,, under the natural projection 17 — » 17 and P(-) 

is a probabilistic kernel, i.e. P. {A) is a measurable function for every A G 23(R m ) and P TO (") is a probability 
measure on R m for every e S], Any functional g on fig now can be considered as a functional on R" 1 x 17, 
9 = {5( s >' ct7 )) s £ R m ,TO G 17}. Below, we denote [g] ro (-) = g(-,^) '■ R m — > R m , G 17. One can write, for 
A G 17 g , that 

(PU o. g - 1 )(.) = / [P m \ A „ o [g]- 1 ]^^). 
Therefore, in order to prove the statement of Lemma 12. 5[ it is sufficient to prove that 

(2.10) p va\[ct x ,tU ifn]ru ^ Pru\[n^ t ]„ ° [f]w for 7r-almost all ZD G P. 

Denote pi = (p Ti , hi), i = 1, . . . , to, and let p s be R m -valued function such that (p 3 ,t) — Y^iLi tiPi>t G R m . 
Then one can show that, for 7r-almost all w G 17, the measure P m possesses the logarithmic derivative equal to 
[p ]w ( we do not give the detailed exposition here, since this fact is quite analogous to the one for logarithmically 
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differentiable measures on linear spaces, see [5]). One can deduce from the explicit formula for that there 
exists c > such that E exp[(p^ ,t)] < +00, \\t\\ < c, and, therefore, that for 7r-almost all nj G 17 

(2.11) f exp[([p s ] ro ( S ),t)]P ro (ds) < +oo, \\t\\<c. 

Due to Proposition 4.3.1 0J, for every w such that (|2.11|) holds true, the measure P m has the form P m (dx) = 
p VJ (x)X m (dx), where the function p ro is continuous and positive. Therefore, in order to prove (|2.10j) . it is 
enough to prove that, for tt- almost all such w and every R > 0, 

(2.12) \ m \B Rm (0M)n[n x , t u ° [fnU ™ A m |s Rm (o,fl.)n[n,, t ]„ ° [fU for ^-almost all w G P. 

In order to prove (|2.12|) . let us introduce auxiliary notions and give their relations with the notions of 
Sobolev and approximative derivatives. 

Definition 2.8. For a given function F : R m — > R m and ct- finite measure x on S(R m ) we say that F is 
direction-wise x-a.s. differentiable, if there exists function VF : M m — > R mxm such that, for every t G M m , 
i[F(- + te) — -F(-)] —> (V.P(-), £), e — > x-a.s. We say that <? is direction-wise differentiable in the L p ^ oc {x) 
sense, if F G L p j oc (x) and, for every f G M m , + te) - F{-)\ -> (VF(-),t),e in L pMc (x). 

Proposition 2.9. 1. Le£ ^(dx) = p(x)X m (dx) with p(x) > Cr > < R for any R > 0. Then 

every function F, that is direction-wise x-a.s. differentiable, is also direction- wise X m -a.s. differentiable, and 
every function F, that is direction- wise differentiable in L Pt i oc (x) sense, is also direction-wise differentiable in 
L p ,ioc{X m ) sense. The function VF from the definition of x- differentiability (either in a.s. or L Pt i oc sense) 
X m -a.s. coincides with the one from the definition of X" 1 -differentiability. 

2. If F is direction- wise L p ^ oc (X m )-differentiable, then F G W p _i oc (W m ,W m ) and V-F coincides with its 
Sobolev derivative. 

3. If F is direction-wise X m -a.s. differentiable, then F has approximative derivative at X m -almost all points 
x G M. m , and Vi* 1 coincides with its approximative derivative. 

Proof. Statements 1 and 2 immediately follow from the definition. Statement 3 follow from Theorem 
3.1.4 |llj and the trivial fact, that the usual differentiability at some point w.r.t. given direction implies 
approximative differentiability at the same point w.r.t. this direction. 

Now, we can finish the proof of Lemma 12.51 Denote f n , w — [f n ]-w, fw = \f\w By the construction, 
[Tr fn]va(s) = [fn]m(s + r), s,r G M"> G CI. This, together with ([23]) and Lemma |2~H provides that 
there exists Slo C with 7r(fi\f2o) = such that, for every zu G CIq, (|2.1ip holds true, and the functions 
/n,w)/ro are either direction- wise P^-differentiable (in the case A), or direction- wise differentiable in the 
L Pi ioc(Pm) sense (in the case B), and V/„. ra = [F s (y„, t)] ra , V/ ra = [F s (a;,i)] ra . This, in particular, means 
that [0. x ,t]za = {det V/ ra + 1 0} for m G Cl . 

Now, we can apply the standard theorem on L p -continuity of the solution to an SDE w.r.t. initial condition 
(see Theorem 4, Chapter 4.2 [12]), and obtain that /„ — * f,[Y s (y n ,t)] — > [F s (x, t)], n — > 00, in L p sense. 
This implies that, for 7r-almost all w G Clo, fn,vj /rojV/ nin7 — * V/ W ,n — > 00 in L p (P^) sense, and, 
therefore, in L p ^ oc (X m ) sense. In the case B, convergence (|2.12[) (and thus the statement of the lemma) 
follows straightforwardly from the statement I of Proposition 12.71 In the case A, convergence (|2.12l) follows 
from the statement II of the same Proposition, and Lemma 3.3 [T7], that provides that, for 7r-almost all vj G fl, 
the sequence {V/„ jn7 } is uniformly approximatively Lipschitz on R m . The lemma is proved. 

3. Proofs of the main results 

3.1. Proof of Theorem 11.31 We prove Theorem 11.31 in two steps. First, we use Lemma T2.5I and show that, 
under condition N, the local Docblin condition holds true inside some small ball. 

Lemma 3.1. Under condition N, there exists £* > 0, such that 
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(3.1) inf / [P*'Alf](dz)>0. 

Proof. Suppose that the grid S is such that, in the notation of Lemma l2.51 

(3.2) /••:<»,./.• I). 

and denote P x ,, x (dz) = (fix* ,t, , X(x, £*) G dz). One can see that P x „, x {dz) — p XrjX (z)P** (dz) with p XrjX < 
1, and therefore 

/ [Px„x^Px„ v }(dz)< [ Ai*'] (dz), x,yeR m . 

Thus, in order to prove (13. 1|) . it is enough to prove that 



inf / [P x ,,x A P x ,,y] (dz) > 0. 

\B(x,,e,) J Rm 



x,yGB( 

The latter inequality follows from the condition (|3.2p . Lemma \2 .51 and relation 

/ [Px t ,x A P x „ y ] (dz) = P(Ox„tJ - - Px„y(-)\\var -> P^.t.), 2/ ~> 

Thus, the only thing left to show is that, under condition N, the grid S can be chosen in such a way that 
holds true. Denote, by dm, the family of all rational partitions of (0, i») of the length 2m; any J G 3 m is the 
set of the type 

J = {ai, b±, a 2 , b 2 , ■ ■ . , a m , b m }, with < a± < &i < • • • < a m < b m < t*, a^GQ, i = 1, ...,m. 
Denote, for the set J of such a type and r G N, 

fij )T . = jvi = 1,. ..,m 3! Tj G (cn,bi) n D, ||p(ri)|| > -, i = 1,. .. ,m 



and span{£*.A(X(a;*, T£— ) 1 p(r i )),i = 1, . . . , m} = M 
Then, elementary considerations show that 

fix,,t» = [J fij- 

Therefore, under condition N, there exist J* = {a*, . . . , a^, 6J„} and r* G N such that P(Qj, <r ,) > 0. Let 
h* G i/oi i = 1, ■ ■ • , to be arbitrary functions such that, for any i, Jh* > inside (a*, b*) and Jh* = outside 
(a*,b*). Consider the grid S* with 

f{u||M| > l },i<m 

Ti = < r , a, = di ,bi = bj , hi = hj , i < m, a,i,bi,hi are arbitrary for i > m. 

[0, i > m 

Then formula (|2.6|) shows that, on the set fi 

Y 3 *'^,**) = Y*" r *(x»,t,) = J/i i (Ti)e r4 A(X(x„T i -),p(T i )), i = l,...,m. 

Since Jhi(ri) ^ by the construction, the vectors {Y s ,l (x*, £*)} are linearly independent iff so are the vectors 
{E. Tt A(X(x-„n-),p(T l ))}. Thus, 

P(Sh.,u) = P(detY(x*,U) ^ 0) > P(fij., r .) > 0, 

that gives ()3.2p . The lemma is proved. 

The last step in the proof of Theorem ll.3l is to combine statement of the previous lemma with the condition 
S and show, that the local Doeblin condition holds true in any bounded region of K m . 

Lemma 3.2. Under conditions N and S, condition LD holds true with 

T(R) = t(R) + t*, R>0. 
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Proof. The process X is a Feller one; this follows, for instance, from Theorem 4, Chapter 4.2 [T2]. Therefore, 
the function x i— > Pf c {0) is lower semicontinuous for any open set O and any t > 0. This, together with condition 
S, provides that, for any R > 0, 



8{R)= inf P*< Jl >(B(s„,e«)) > 0. 

\\x\\<R 



Denote 



7* - grf / A P*' (tfe) = inf 



1 II f f II 

9 II 33 » lluar 



>o, 



then, for any x, y £ P(x*, £*) and any yl € S( 



P^* (A) + P** (R m \A) < 2 - 2 7 ». 

Take some P > and denote T = T(R) = t(R) + t*. Take two independent processes X X ,X 2 , satisfying 
equations of the type (|0.1j) with the independent point measures v ,v and starting from the points x,y. 
Then, for x,y £ P(0, P) and any given A £ 23 (R m ), we can write 



P K J (A) + P^ = P P f x \ tm (A) + P% 2m)) (R m \A) 



< 



< E 
Therefore 



21{Xi(t(-R))^-B(t»,e,)}U{X 2 (t(fl))0S(t t ,e»)} + (2 - 27*)I{xi(t(_R)),{X 2 (t(_R))eB(t»,e,)] < 2 - 2^5 2 (R). 



,J [Pi APj](dz) = l-i sup ||Pj-Pj|| tiQr > 7 ^ 2 (P)>0. 
This completes the proof of Lemma 13.21 and Theorem 11.31 



inf 

NI,IMI<; 



3.2. Proofs of Theorems I1.H1.21 Statements, close to those of of Theorems ll.l|1.21 are well known in 
different settings, and there exists several well developed ways to prove such kind of a statements. For instance, 
statement of Theorem 11.11 can be derived straightforwardly from Theorems 5.1, 6.1 [35], since condition LD 
provides that, for any time-discretized process X A = {X(kA),k e Z + } (so called A-skeleton chain), any 
compact set is a petite set. However, it is difficult to obtain on this way an explicit expressions (or estimates) 
for the constants Ci,C2, involved in the principal estimates (|1.3|) . l|1.4|) . Therefore, we use another way to 
prove (|1.3|) . (|1.4|) . based on the coupling technique. In general, we follow the scheme of the proof, proposed for 
diffusion processes in [27], [2S], but our construction of the coupling slightly differs from the one used there. 
This allows us to exclude from the construction auxiliary conditions, such as Harnack inequality used in 27J 
or condition (T) used in [28] . that are unnatural and restrictive in the context of SDE's with a jump noise. 

Let us start with the construction of the coupling used in the proof. Since various coupling constructions 
are used widely in the literature, we restrict our exposition by the sketch of the construction only, and omit 
technical details. First, let us give two basic "bricks" of our construction. Everywhere below we call "coupling" 
any R m x R m -valued process Y = (V 1 , Y" 2 ) such that the laws of Y 1 ,Y 2 coincide with P^, P M2 with some 
given ni,fi 2 £ 

1. Simple coupling. We call Y = (Y 1 , Y 2 ) a simple coupling with a starting point y — (2/1,1/2) £ R 2m , if it 
is a coupling with fii,2 = ^ 12 i and the processes Y , Y 2 are 

(a) independent, if y\ 7^ 1/2! 

(b) equal one to another, if y\ = y%- 

In order to show that such process exists one should simply consider two equations of the type (jO.ip with 
random point measures v%, v-i that are either independent in the case y\ 7^ 2/2, or equal one to another in the 
case yi = y 2 . 

2. Gluing coupling. We call Y = (Y 1 ,Y 2 ) a gluing coupling with a starting point y = (2/1,2/2) £ R 2m and 
terminal time T > 0, if it is a coupling with /112 = S yi 2 , and 

(a) Y l =Y 2 , if yi= 2/2; 

(b) P(Y\T) = Y 2 {T)) = / Rm [Pi A Pl\{dz), if 2/1 ± 2/2- 
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One can show that such process exists in a following way (one need to consider the case y\ ^ y 2 only). 
First, due to the standard Coupling lemma (also called Dobrushin lemma, see [10j). there exists a probability 
measure x on M. m x M. m such that 

n{{z = (z 1 ,z 2 )\z 1 = z 2 }) = J JP£ A P£](dz). 

Then, this measure is considered as a distribution of Y at the moment T, and the distribution of the whole 
trajectory of Y is defined by this measure and the family of conditional distributions {P(Y G -\Y{T) = z),z £ 
jjm x jjm} ( sucn a construction is correct since D(M + , M. m x M. m ) is a Borel measurable space). Any conditional 
distribution P(Y G -\Y(T) = z),z= (zi,z 2 ) G K m x W n can be constructed, for instance, as the product of 
the measures 

P(X G -\X(0) = y u X(T) = Zl ), P(X G -|X(0) = y 2 ,X(T) = z 2 ). 

Both simple and gluing coupling can be constructed simultaneously on one probability space (f2* , 5"* , P) for 
all y = (j/i, y 2 ) G K m x R m in a way, that is jointly measurable in probability variably u) and space variable 
y. For the simple coupling this follows from the standard theorem on a measurable modification (note that, 
by the construction, this coupling is continuous in probability w.r.t. y on the sets {y\ 7^ y 2 } and {yi = y 2 }). 
For the gluing coupling one can verify this using the lemma about three random variables f[29j). Further we 
denote both these couplings with a starting point y = (2/1,2/2) by Y Vim . 

Now, we can describe our construction. We fix T, R > 0, that will be defined later. Construct the probability 
space (f2, CF, P) as an infinite product 

00 

00 . ^ 00 

n = n xxn k , J = y ®(x)?fc, p = p xxp k , 

k=l , fc=l 

k— 1 

where, for k > 1, (O/-, ^fe, P/.) = (il*, P*). Given fJ,x,^2 G T, construct on (fi , Jo, ^b) two independent M m - 
valued elements Z 1 ' 2 with Law(Z 1 ^ 2 ) = /ii. 2 . Next, consider the simple coupling {Yy uv% , (2/1,2/2) G K m x R m } 
that is defined on (f2i, "Ji, Pi), and consider the process Z\(t) — Y yi > y2 (t) 

yi=z 

Qt =iv£{t\\\Zl(t)\\ < R,\\Zf(t)\\ < R}. 

Consider the gluing coupling {Y 2 1,V2 ,{y\,y 2 ) G K m x R m } that is defined on (0,%, 3^, P2), and consider the 
process Z 2 (t) = Y Vl ' V2 (t — QA ,t > Q\. Denote Q 2 = Q% + T. Repeat this construction 

Vi=Zl(Qi),y 2 =Zj{Q 1 ) 

iteratively: take the next "independent copy" of the simple coupling, substitute the terminal value Z 2 (Q 2 ) 
as the starting point in it, and wait till the random moment Q3 when both its coordinates appear inside 
the ball {\\x\\ < R}. Then take the next "independent copy" of the gluing coupling, substitute the terminal 
value Z^Qs) as the starting point in it, wait till the moment Q4 = Q3 + T, and so on. Define the process 
[Y(t) = (Y^y 2 ^))^ G R+}, by Y{t) = Z k (t),t G [Q k -i,Qk); below we call this process "switching 
coupling" . It has the following properties by the construction: 

(i) LawCr(.)) = P^ = l,2; 

(ii) for any k G N, Y 1 ^) = Y 2 (t),t > Q k as soon as Y\Q k ) = Y 2 (Q k ); 

(iii) Pfy 1 ^) = Y 2 (Q 2k ) Y 1 (Q 2k _ 1 ) + r 2 (g 2fc _i)) > inf J Rm [Px A Py] (dz), k G N. 

Denote fe* = min{k\Y 1 (Q k ) — Y 2 (Q k )} and put Q* = Qk,', Q* is the "gluing moment" for the coordinates 
of the switching coupling Y. Let us give some estimates that, together with the property (iii) and condition 
LD, allow one to control the tail probabilities for Q*. Everywhere below, we suppose that cb > 0. This does 
not restrict generality, since one can replace <fi by <p + C with a properly chosen constant C. 

Fix some c G (0, 1) and take R > such that <p(x) > max 1 for ||x|| > R. Denote L = mi{t\\\X(t)\\ < 
R}. 

Lemma 3.3. For any fi € CP with <f>(n) < +00, 

(a) 4>{Pl) <0(/x)exp[-at] + J; 



, t > 0. Denote 

=z 2 
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(6) P M (L>t) <0Ou)exp[-(l-c)at]. 

Proof. Inequality (a) is a standard corollary of the Dynkin formula and condition A(p < —ct(p + ~f (see, 
for instance, beginning of the proof of Theorem 6.1 [12])- Consider, together with the process X, the process 
X(-) = X(- A L) (i.e., the process X, stopped at the first moment of its visit to the ball {||x|| < R}). By the 
construction, its (extended) generator A satisfies, for \x\ > R, the condition 

Acf>(x) = Acf>(x) < -a<t>{x) +7 < -(1 - c)acj>(x). 

Then, writing down the relation, analogous to (a), for the process X, we obtain that 

E^(X(t))l,, jtm>R < / cf)(x)fi(dx) ■ exp[-(l - c)at] < exp[-(l - c)at]. 

J\\x\\>R 

Since, by the construction, 4>{x) > 1 for ||x|| > R, this implies (b). The lemma is proved. 
Corollary 3.4. There exists an invariant measure /i» forX, such that </>(/!*) < 

Proof. Take some /i 6 T with 4>(p) < +00 and consider the family of measures {//, t £ K. + }, 

(the so called Khasminskii's averages). It follows from (a) that limsup t 0(/i*) < and this, together with 
the Fatou lemma, provides that 

(i) the family {//*,t G R + } possesses some weak partial limit /i* as t — * +00; 

(ii) 0(A*.) < 

Moreover, the weak partial limit /z* is an invariant measure for X (the proof of this fact is simple and 
standard, so we omit the detailed exposition here). This completes the proof. 

Estimates of the Lemma 13731 can be extended from the process X to the coupling Y — (Y 1 , Y 2 ). The only 
delicate point here is that Y does not have to be a Markov process, so we need some accuracy in writing down 
the analogues of the estimates (a),(b). Denote 

tz%) ee 4>( yi ) + 4>{y 2 ) and \\y\\ x = max[|| yi ||, ||y 2 ||], y = (y u y 2 ) e R m x R m , 

then ip(y) — » +00, ||y||oo - * +00. Take R > such that f/>(y) > max 
Z = inf{i|||y(0||oo <R}- 

Lemma 3.5. (a) Let /Zi,/z 2 £ 3 s iwifft. 0(mi); 0(/Ua) < +00, and Y &e an arbitrary coupling with Law(Y 1,2 ) = 
Mi. 2- TTien 

< W>(Mi) + 00*0] exp[-at] + ^. 

(b) Let /zi,/z 2 S T im£/i 0(/Zi), 0(/z 2 ) < +00, and Y 6e £/ie simple coupling with Law(Y 1,2 ) = /Zi.,2- Then 

P(L >t)< 2[0(/ii) + 0(it 2 )] exp[-(l - c)at\. 

Proof. The first statement follows immediately from Lemma [3.31 and equality Eip(Y(t)) = E tll <p(X(t)) + 
E J[i2 0(X(i)). In order to prove (b), one should consider separately the cases Y 1 (0) = Y 2 (0) and Y 1 (0) ^ Y 2 (0). 
In the first case, both coordinates Y 1 , Y 2 are the same and move like the process X, i.e., the statement (b) of 
the Lemma 13.31 implies that 

P(l > t, Y\0) = Y 2 (0)) < [<j){nx) + </>(/z 2 )] exp[-(l - c)at}. 

In the second case, the the joint dynamics of the coordinates Y 1 , Y 2 is described by the Markov process, and 
the generator A of this process satisfies the relation 

A1>(y) = A<j>( yi ) + A4>( V 2) < -ocxP(y) + 2 7 , y= ( yi ,y 2 ) e R mxm . 



|^,1 for HylU > R. Denote 
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Applying the same arguments with those used in the proof of Lemma 13. 3[ we obtain that 

P(L>t,Y 1 (0)^Y 2 (0)) < [0(Mi) + 0(M 2 )]exp[-(l-c)at]. 

This provides the needed estimate. The lemma is proved. 
Using the Holder inequality, we obtain that 

oo oo i 

(3.3) P(Q* > t) = p(q* >t,te (Q 2fc , Q 2 fe+ 2 ]) < [ p ( Yl (Q2k) ¥= y 1 {Q2k))] 3 [p(Q2k+2 > t) 



fe=0 



fe=0 



Denote 



S(T, R) = inf 

IMI,bll<-R 



[PjAP v T ](dz). 



By the construction, the event {Y 1 (Q 2k ) = Y 1 (Q 2k } does not depend on the values of the process Y up to 
the moment Q 2k - 2 , and its probability is not less than (1 — <5(T, P)). Thus, we have an estimate 

(3.4) P(Y 1 (Q 2k ) ± Y\Q 2k )) < (1 - S(T,R)) k . 

Now let, in the construction of the switching coupling Y above, R to be taken equal to R given prior to Lemma 
I3.5[ ancl T — T(R) (see notation in condition LD). Then <5(T, P) > 0, and ()3.4p gives an exponential (w.r.t. 
fc) estimate for P(Y 1 (Q 2k ) ^ Y 1 (Q 2k )). Next, P(Q 2k + 2 > t) can be estimated in the following way. Denote 
Afc = Qk+i — Qk, 9 fe = 3^Q k j k > 0, where {£Fj, f e R + } is the filtration generated by Y. Then, for every k > 0, 
A 2/ t_i = T and, from the construction of the switching coupling Y and statement (b) of Lemma 1331 we have 
that 

(3.5) P(A 2fe > t\J 2k ) < 2[4>{Y 1 (Q 2k )) + <t>{Y 2 {Q 2k ))] exp[-(l - c)at], k > 0. 

From the statement (a) of Lemma 15751 applied to t = T and /ii. 2 = Law(Y 1 ' 2 (Q 2k ^ 1 )), we obtain that 

P[0(^ 1 (Q 2 fe)) + 4>{Y 2 {Q 2k W 2k -i] < — + 2 sup 0(P), k > 1, 



a 



\\x\\<R 



here we used that ||P 1:2 (Q 2 fc-i) || < P. This, together with (|3.5p . gives that 



P(A 2fc > tl^fc-i) < [— + 4 sup </>(P))l exp[-(l - c)af], fc > 1, 

L OL \\x\\<R J 



and, consequently, 



El exp[— -AafcHSifc-i) 



Analogously, we have that 



-^+4 sup 0(P)) 



47 



4 sup 0(P)) 
INKS 



fc > 1. 



P(exp[^A ]) < 2[0(/ii) + 0(/i 2 )]. 

At last, A 2 fc + i = T, fc > 0. This and two previous estimates provide that 
(3.6) 



Pexp[ 



A-c 



2 fe+i 



Q 



= E]J exp[i-iAj] < 2[0( m ) + 0(/i 2 )] • exp[(fc + -T] • ^ + 4 sup 0(P)) 



1 -c„ 



2A- + 2J — 



■47 



x\\<R 



2exp[i— -T\[(/)(ni) +0(^2)] -exp^-D], fc > 0, with P> = i—^T + In [— + 4 sup 0(P)) 
2 ' 2 L a |H|<.R 



Take p = max 



1,- 



21) 



ln(l-(5(T,_R,)) 



then, by Holder inequality, 



£exp[i^Q 2fe+2 ] <max|2exp[i^T][0(/i 1 ) + 0( M2 )],lJ> ■ ( I - A ( r. /?))■ ?. 



and, by Chebyshev inequality, 
P(Q 2fc+2 >t)< exp[- 



2p 



t] ■ max < 2 exp[ — - — T] [<t>(n\) + 4>(p 2 )], 1 > • (1 -d(T, JZ))"* 
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IS 



This estimate, together with 



, gives that 



PU) /i u.ax { 2exp[i^T][0(/ii) + <f>(^)], 1 } ■ l-d- MT.R)r 



- 1 1 - C 

■ exp[ — t], t G 



4p 



Now, we can use standard arguments (see [37]) and complete the proofs of Theorems 11.11 11.21 In order to 
prove (|1.3|) . consider the switching coupling with /ii = /i and /i 2 = given by Corollary [33] Then 



< P{Q* >t)< Ci[^(/i) + 1] exp[-C 2 <], t e 



with Ci = max 



2 1 



2exp[if£T]- l-(l-5(T,i?)): 



Analogously, for a given s, t G M + consider 

the switching coupling with — and /i 2 = -FVj. Then, by statement (a) of Lemma 13.31 0(/i 2 ) < '/K/ 1 ) + 
and 

E M IIPm(-I^o) - P/.(0L„ r ,s» J < >*) < C i[^(M) + l]exp[-C 2 t], i G M+ 

with Ci = 2Ci. These two estimates imply (|1.3p . (|1.4j) with [ii nv = fi*. Theorems I l.l|1.2l are proved. 

4. Sufficient conditions. 

In this section, we give some sufficient conditions for R,N,S to hold true. 

4.1. Condition R. There exists a wide range of conditions, that are sufficient for the recurrence condition 
R, see [20], Section 2.3. Here, we give only one condition of such a type, that is an analogue of Lemma 2.4 
[217] . but with a condition (11) of this Lemma replaced by an essentially weaker one (condition 3 below). 

Proposition 4.1. Suppose that the following conditions hold true. 
1. There exist R,a>0 such that 

(a(x),x) n m < -a|M| 2 , \\x\\ > R. 



2. There exists q G (0, +00) such that 



[ \\u\\ q Il(du) < +00. 

J\\u\\>l 



3. The function c can be decomposed into a sum c = c\ + c 2 with c\ , c 2 such that 
3a. for some function ip, — > as \\x\\ — > 00, 

||ci(x,u)|| < ^(a;)|M|, u G R d , x G R m ; 

36. \\x + c 2 {x,u)\\ <\\x\\, x G M m , ||u|| > 1, pa(-,u) = 0, ||u|| < 1. 

Then condition R holds true. 

Remark. In the case B, condition 3 holds true automatically with c\ = c, c 2 =0. 

Proof. Consider <p G C 2 (M m ) such that <p(x) — \\x\\ q , \\x\\ > R. Without losing generality we can suppose 
that the constant R is chosen in such a way that 5 r = sup^^,, ^J^p < > R. Then, for ||x|| > 2R, 



Acj)(x) = q(a(x),x) Mm \\x\\ q - 2 
= <?(a(:r),x)R m ||a:|| ,3 ~ 2 - 



Ml<i 



\\x + c(x, u)||« - \\x\\* - ql^Mx, u),x) Mm ||x||«- 2 J U(du) = 
\\x + Cl (x,u)\\ q - ||x|| 9 -gI M <i(ci(x,u),x) B «||a;[| 9 - a U(du) + 
\\x + c{x,u)\\ q - \\x\\ q ]u(du). 



||u||>l 

Due to condition 3b, 

\\\x + c(x,u)\\ q - \\x\\ q ]n(du) < 

\\u\\>l L J 



Ml>i 



\\x + c(x lU )\\ q - ||a: + C2(a;,ti)||« H(du) 
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IMI>i 



\\x(u) + c 1 (x 1 u)\\ q - \\x{u)\\ q U(du) 



where x(u) = x + ci{x, u). If O is an open subset 
(4.1) 



{x + sc, s G [0, 1]} C O and tj> G C 2 (0), then 



|0(x + c)-0(x)-(V^),c) Rm | < ||c|| 2 sup ||[V 2 </>](x + S c)|| 

se[o,i] 

(the Taylor's formula). Then, for r > 2i?, 

Ilx + c^a;,^)!! 9 - \\x\\ q - q(c 1 (x,u),x) n m\\x\\ q - 2 ]n(du) < const • [|[x||(l - 5 r )} q ~ 2 [\\x\\5 r } 2 , |[x|| > r, 
IMI<i L J 

here we applied (|4.ip with 0(x) = |x|| 9 , O = {\\x\\ > r(l — <5 r )} and c = cx(x,u). 

If g > 1, then, applying the inequality \4>{x + c) — 4>(x)\ < \\c\\ sup sg [ 01 ] |[V0](x + sc)\\, with the same </>, 
x = x(u) and c = Ci(x, u), we obtain analogously 



Ml>i 



\x(u)+ Cl (x,u)\\ q - \\x(u)\\ q n(du) 



< const •[H(l-5 r )]«- 1 [||x||5 r ] ) > r, 



here we used that ||x(tt)|| < ||x||. If q < 1, then we apply inequality a q + b q > (a + b) q , a,b G M + and write 



IMI>i 



|x(u)+ Cl (x,u)|| 9 - ||x(w)|| 9 U(du) 



< I \\ Cl {x,u)\\ q n(du) < const • [i>{x)] q . 

\\u\\>l 



Thus, 



A<(>{x) < -qa\\x\\ q + C\\x\\ q {(l - 6 r ) q - 2 [S r } 2 + (1 - 6 r ) q - l 5 r + 6 2q } 



= <f>{x) [-qa + C^l-Sry^iSrf + il-Sr^Sr+S 2 "}] , ||x|| > T 

with some constant C. Since 5 r — » 0,r — > +oo, this gives R. The proposition is proved. 

4.2. Condition N. In [16] . [18] the conditions were given, sufficient for the set {St = R m } to have probability 
one. Below we give a more mild version of these conditions, sufficint for this set to have non-zero probability. 
Denote, for x G K m , & x — {u\Ir™ + V x c{x, u) is invertible}, and put 

A(x, u) = [I R m + V x c(x, u)]~ 1 A(x, u), u G X . 

Denote, by S m = {v G M m |||u|| Rm = 1}, the unit sphere in M m . 

Proposition 4.2. Suppose that there exists x* G R m such that 



(4.2) We>0,veS m n(ue0 B . (A(»„«), w) R m ^ 0, ||c(s„«)|| < ej > 0. 
TTien condition N ZioZds irwe i/izs x* and arbitrary t* > 0. 

Proof. We need to prove that, on some fio with P x »(^o) > 0, 

(4.3) {£tA(X(r-),p(r)), r G 2) n (0, f)} = R m . 

Below, the set J7o will be constructed explicitly, and in particular, on the set fio, the matrix £q will be non- 
degenerate (and therefore, for any variables 6,t, < 9 < r < t, the matrix also will be non-degenerate). 
Then, on this set, 

{£*A(X(r-),p(r)),r G D} = e*{£J-A(A:(r-),p(r)),r G 2) n (0,*)}, 
and (|4.3p is equivalent to 

(4.4) {[eS-J-'A^Cr-J.Kr)), r G D D (0, i)} - R m . 

For n > 1, consider the set D n = {t 6 D, ||p( r )|| > }■ This set is a.s. locally finite, and therefore can be 
enumerated increasingly, T> n = {rf, t£, . . . }. Denote, for fc < m, S£ = {[fij""]-^^^-),^?)), j < fc}. 
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By the construction, S% is a linear span of a finite family of vectors. Let us consider the fc-th vector from this 
family, 

k{X{T%-),p{T%)). 

One can construct the measurable map V : (R m ) k ~ 1 — > S m such that 

Vj = 1, . . . , k - 1 V(x lt . . . ,x k -i) ± Xj, n,...,iner. 

We will write v^_ x = M£ J J" 1 ] • F({[£</ J^Apf (t"-),p(t™)), j < fc}), where (M)* denotes the adjoint 

matrix for M. The random vector is well defined on the set {£ r *~ is invertible} G 3>,.„_, and is 

3> fc „ _ -measurable. 

The value p(r^) is independent of 9> fc „_, and its distribution is equal to j-n(- n {||u|| > 1}), where 
X n = n(||u| > 1). Therefore, on the set {£ T ^~ is invertible}, 



P([£?" ]- l k{X{TZ-),p(T%)) is well defined and £ S^J^J = 



(4.5) 

= ^-n(u g e 

An ^ 



(A(jC,u),v) R m ^()) 



> 



x=X (r»-),«=^«_ 1 A„ t-Gi 

For a given e > 0, consider, for n > 1, the maps 



- inf nfwee, (A(x,«),«) R m ^o) 



x=X(r?-) 



m G 6 X „| (A(ar«,ti),u) K m ^ 0, ||c(a;*,ti)|| < e, > 

Since functions c, V^c, A are continuous w.r.t. x on their domain, every /„ is lower semicontinuous. For every 
v G S m , f n (v) monotonously tends to a positive limit as n | oo. Therefore, due to Dini theorem, there exists 
n = n(e) £ N such that 

1- 



inf u(u G 9 2 

»ES m V 



(A(x»,u),w) R ™ ^ 0, ||c(x»,u)|| < e, > 



> 0. 



Analogously, the function 



3n inf nfuee, 

tiES m V 



(A(x,u),v) R m ^0, ||c(x,u)|| < e,||u|| > -L) 
is lower semicontinuous, and thus there exists 5 = 5(e) > such that 

(4.6) inf n(ueB x {A{x,u),v) Rm ,6 0, ||c(s,u)|| < e, ||u|| > -^t) > 0. 

Define iteratively ei, . . . , e m in the following way. Take e m > arbitrary, and put 



e k -i = mm 



1 x( ^ 1 



k = 2, . . . , m. 



By the construction, <5(£&) > 3^/<fe £i,k = 2, . . . ,m. Put n = max;<fc n(e;). Let us show that, for these n G N, 
£i, . . . , e m > 0, and properly chosen r G (0, — ), the set 



o = [n^ni] n 



has non-zero probability, where 



fl r = |£** is invertible |, 

n r k = {V n n((k-l)r,kr} = {r r k }, ££_ 1)r is invertible, ||X(r fe "-)-X((fc-l)r)|| < e k - U \\X(kr)-X(^)\\ < e k , 

\\X(t2)-X(tZ-)\\ < e k and [^H"^^^-),^) ^n{[^-]^A(X{^-), P (r-)),j < k}}. 
It is easy to verify that P[fi r |3 r mr ] > a.s. (the proof is omitted), so we need to verify that P(n^ =1 fi^) > 0. 
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The process X can be described in the following way: at the moments r£,k > 1, it has the jumps of the 
value c(X (t]} —) , p(rj})) , and on every interval of the type {t^_ x , T„),k > 1, it moves due to SDE 



(4.7) dX n (t) = a n {X n {t))dt + c{X n {t~),u)v{dt,du), 

J\M\<-k 

where a n (x) = o(x) + i||„|| e [j. i] c(x , u)U(du) , t^ 1 = 0. Denote, by X n (x, •), the solution to (|4~7|) with X n (0) = x 
and, by £?' x '', the correspondent stochastic exponent (both X™(a;, •) and £. n ' x are independent of the point 
process p| d™)- Then, for a given n G N, £i, . . . , s m > 0, one can choose r G (0, |r ) small enough for 

D = inf PfVs < r \\X n (x, s) - x\\ < mine,, £^' x ' s is invertible) > 0. 

Now, let us estimate P(fi£| n^ fc f2[). The set n^- fe fi[ belongs to 3vfe_i) r , and, on this set, 

||X((fc-l)r)-x*|| <3 ^ ej + 2e fc _i. 

Z<fc-1 

Now we can take subsequently conditional expectations first w.r.t. 3>»_ V (t(p(t£)), then w.r.t. 3>»_ (on this 
step, we use (|4.5p ). and, at last, w.r.t. 37fc_i) r , and write that, on this set, 



p(fi£ n {\\X((t{}-) -x4< 3$>, \\X(tf) - X((r fe "-)|| < e h , \\X((kr) - X((rj?)|| < e k }\? (k . 1)r ) > 



> D ■ (rA„e~ rA ») • -L • D = r 7 D 2 e" 
where 

7 = min inf Ilfu G Q x (A(x, u), u) R m ^ 0, |[c(x, u)|| < s k , \\u\\ > -) > 

l<k x£B(x*,3J2i <k ) ^ n ' 

by the construction. Therefore, 

P(n£Lx^) > 0, 

that gives the needed statement. The proposition is proved. 

Let us also give sufficient condition for N, in which conditions on the coefficients a, c and Levy measure of 
the noise are separated. 

For w G S d (recall that S d denotes the unit sphere in M. d ), g G (0,1), denote by V+(w,g) = {y G 
K d |(y, w)^ > g\\y\\m d } the one-sided cone with the axis (w) = {tw,t G R}, and by V(it), p) = {y G 
R d ||(y, w) R d| > the two-sided cone with the same axis. 

Proposition 4.3. Suppose that the following two conditions hold true. 

1. For every w G S , there exists g G (0, 1), such that, for every S > 0, 

IL(V(w,g) n < 5}) > 0. 

2. For some point x*, there exists its neighborhood Xt such that 
2a. c(x, u) — x( x ) u + S(x, u )> x £ Xt , and 

\\8(x*,u)\\ + \\V x 5(x.,u)\\=o{\\u\\), H|-f0; 

2b. the functions %(•) and a(-) belong to C 1 (0 Xr ,W nxd ) and C 1 {0 Xt ,W n ) correspondingly, and satisfy the 
following joint non- degeneracy condition: 



rank 



Va(i»)x(a; t ) — Vx(i»)a(j: t ) 



m. 



TTien condition N ZioZcfe £rwe witt this x* and arbitrary t* > 0. 

Remark. Condition 2b. is formulated in the case A. In the case B, it should be replaced by the condition 
det Va(x*) ^ 0, and, in this case, condition 2a. trivially holds true with x( x ) = I«. m - 
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Proof. We use Proposition ^. 21 Denote Vo(a; f )x(i») — Vx(it)o(i») = A. It follows from the condition 2 
and explicit formula for A, that 

(4.8) A(a:*,tt) = A« + o(||tt||), ||u]| -f 0. 

Let v G S m ,e > be fixed. Consider the linear subspace L v — {u G W l \Au _L v} — A* < v > x (A* is the 
adjoint matrix for A). This subspace is proper, due to condition rank A = m. Take w G S d such that w _L L v , 
then, for any g G (0, 1), g) = 0, and, therefore, there exists c = c(«, g) > such that 

u)r™ I > c||u||, udV(w,g). 

This, together with ()4.8|) . provides that 

(4.9) \(A(x*,u),v) um \ >c\\u\\ + o(\\u\\), u G V(w, g), \\u\\ — > 0. 

Take g from the condition 1 of the Proposition, and 5* = e ■ [V'*(x*)]~ 1 (V>* is given in the condition l[l.ip). 
Then, for every 6 G (0,6*), the measure IT of the set V(w,g) f) {\\u\\ < 6} is positive, and, on this set, 
||c(x*,u)|| < e. On the other hand (|4.9[) implies that, for 6 small enough, 

(A(x*,u),v)k™ ^ on the set V(w,g) (~) {\\u\\ < 6}. 

The proposition is proved. 

4.3. Condition S. One possible way to provide that condition S holds true is to use general support theorems 
for the distribution of the of solution to SDE with a jump noise. For instance, Theorem I [2S] provides, in the 
case A, the following result. 

Proposition 4.4. Consider U, the set of sequences {(t n ,u n ),n > 1}, where {t n } C R + is a strictly increasing 
sequence with lirat n = +oo, and {u n } C suppLT is arbitrary. Suppose that, for any given R,T G R + , for every 
x with \\x\\ < R and e > there exists a sequence {(t n ,u n )} G U such that the solution to the equation 

Z(t) = x+[ a(Z(s))ds+Y^ c(Z(t n -),u n ), t G M + , 



J " t n <t 

satisfies the condition \\Z(T) — < e. 
Then condition S holds true. 

Another possibility is to give some straightforward conditions, that seem to be more suitable in a certain 
concrete cases. Let us formulate, without a detailed proof, one condition of such a type. Note that, unlike the 
previous Proposition, the next one does not require moment restriction on the Levy measure of the noise, and is 
formulated for the both cases A and B simultaneously. Denote, for any x G M. m , H x (-) = H(u 6 R d \c(x, u) G ■). 

Proposition 4.5. Suppose that, for every x G W 71 , v G S m , there exists g G (0, 1) such that, for any 6 > 0, 

(4.10) Il x (V + (v,g) n {|M| < 6}) > 0. 

Then y G suppP^ for every x,y G K m , t > 0, and therefore condition S holds true. 

Sketch of the proof. Take v = ir^Efif) Q S (0, 1) from the condition (|4.10[) for the given x and v, and 
5* = i 1 1 3/ — x\\. Then there exist 61,62,7 > such that < £1 < 62 < 6*, 

U x (V + (v,g)n{\\y\\ G [6 h 6 2 ]}) > 0, 

and 

\\(x + c) - y\\ < \\x c G y + («, g) n {||y|| G [*i,* 2 ]}. 

Then arguments, analogous to those given in the proof of Proposition 14. 21 allows one to conclude that, for any 
two points x =/= y, there exist 7 > and t > such that, for any s G (0,t), 

(4.11) P x (\\X(s)-y\\<\\x-y\\-l)>0. 
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Let e £ (0, ||x — be given, then, since the process X is Feller, one can conclude from (14.11(1 that there exist 
t e > 0, 7 e > such that, for any t <t e , 

Pt = inf P z (\\X(s)-y\\<\\z-y\\- 7e )>0. 

z:\\z-y\\e[e,\\x-y\\] 

This implies that, for any x ^ y and e > 0, for any t < t £ , 

\\x-v\\ 



1. 



PJX(t) £ B(y,e)) > \p±] N > 0, where N 
Via the Markov property of the process X, this implies the statement of the Proposition. 

4.4. One-dimensional case. Proof of Proposition lOTTl In the case m = 1, the sufficient conditions given 
in the previous subsections can be made more precise. For instance, the following version of Proposition 14.21 
holds true. 

Proposition 4.6. Let m = 1 and suppose that there exists x* £ K such that 
(4.12) u(u £ e x „ A(x*,u) ^ fj) > 0. 

TTien condition N holds true for any t* > 0. 



We omit the proof, since it is totally analogous to the one of Proposition ^. 2\ except one point, that causes 
the difference between conditions (|4.2|) and (|4.12|) . For m = 1, we have to apply estimate (|4.5p only once, for 
the jump moment r™. This means that we do not have to control the position of the process after the jump 
at this moment, and thus, when m = 1, the limitation involving e can be removed from (|4.2[) . 

Now, let us prove Proposition lO.il formulated in the Introduction. Condition R is provided by Proposition 
14.11 Let us proceed with the conditions N and S. Since II(M\{0}) > 0, either IT((— oo, 0)) or II((0, oo)) is 
non-zero. Let, for instance, II((0, oo)) > 0. Take R large enough for swp x>R ^p- < 0, then y £ suppP* for 
any y > R and any x £ K, t > 0. This follows from Theorem I 25j in the case J H |u|n(du) < +oo, and 
from Theorem 3 |26j in the case J R \u\H(du) = +oo. This provides that S holds true with arbitrary t > 
and > R. In order to provide N for some x* > R, let us use Proposition 14.61 In the case of additive 
noise, A(x,u) = a(x + u) — a(x). Therefore, if n(R\{0}) > and (|4. 12[1 fails for every x* > R, then there 
exists a sequence {x n } with \x n \ — > +oo such that a{x n+ i) = a(x n ). This, however, contradicts the condition 
lim sup < 0. The proposition is proved. 

\x\ — > + oo 

5. Counterexamples 

We have seen that three basic conditions R,N,S imply exponential estimates (|1.3|) , (|1.4[) . In this section 
we give counterexamples that show that, as soon as any of these conditions is removed, the solution to (|0.1|) 
may fail to be ergodic (i.e., to possess a unique invariant distribution fii nv £ ?). 

The cases, when conditions R or S are missed, are quite standard and simple, thus we just outline the 
corresponding examples. 

Example 5.1. Let m = d = 1, c(x,u) = u, II = 2Si + <5_i and a(x) £ C 1 (R) is such that a(x) — — c, |x| > 1, 
with c £ (0, 1). Both conditions N and S hold true here (this can be provided by the arguments from subsection 
4.4), but R fails. The law of large numbers provides that, for every x > 1, 

PJ lim X(t) = +oo, inf X(t) > 1) > 0. 
This implies that X does not have any invariant probability measure. 

Example 5.2. Let m = d = 1, a(x) £ C 1 (R) be such that a(x) = —x, \x\ > 2 and a(x) = 0, |x| < 1. Let 
also II = Si and c(-, 1) £ C 1 (M) be bounded and such that c(x, 1) = signx, |x| > 2 and x ■ c(x, 1) > 0,x £ R. 
Then condition R holds true, and N holds true for any x* with |x*| > 2 (Proposition I4.6|) . Condition S fails: 
starting from any set A + = [1, +oo) or A_(— oo, — 1], the process X remains in this set with the probability 1. 
Therefore, there exist at least two different invariant measures for X, supported by these sets. 
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The last example is more non-trivial, and is concerned with the case where R,S hold true while N does not. 

Example 5.3. Let us start with an auxiliary construction. Consider the unit circle C = ^S 2 on the plane 
R 2 , and define the discrete time Markov process Z on C by its transition probability 



Q(z, •) = (!- 3p)fc»(-) + V «*(•) + Sa±i (■) + (■) 



z G C, 



where p G (0, i) is given, and every arithmetic operation on C is defined as the same operation on [0, 1) = C 
modulo 1. If Zq = z is any point from [0, 1) = C, then there exists a non-zero probabilities for Z n to be equal 
to each point of the type 



3~ n y + Y,a k 3~ k , a k e {0, 1, 2}, k = 1, . 



,n, 



fe=i 

and therefore, the set lJ ngN supp P(Z„ £ -\Zo = z) is dense in C, i.e. the process Z is topologically irreducible. 
Now let us show that Z possesses at least two different invariant measures (in fact, the set of invariant measures 
here is much larger). 

Consider, together with Z, the sequence T n defined by 

f(T n -l)V0, Z n+1 =3Z ni 
T = 0, T n+1 = I V ' ' + ' n > 0. 

[T n + 1, otherwise, 

Then {T n } is a birth-and-death Markov chain with probabilities of birth equal to b k = b = 3p and probabilities 
of death equal to dk = d = 1 — 3p. We have that d > b since p < g, and therefore this chain is ergodic, that 
means that for any given e > there exists L e € N such that 

(5.1) supP(T„>L e ) <e. 

n>0 

Take Zo = and consider some weak limit point [i® for the sequence of Khasminskii's averages 

j= p (Zn e .\Z = 0) 

(see the proof of Corollary 13. 4p . By the construction, any digit in the 3-adic representation for Z n , with the 
number of the digit greater then T n , is equal to 0. This means that (i®(A®) > 1 — e, where 

A® = {z G [0, 1)| all 3-adic digits for z, with the number of the digit > L e , are equal to 0} 

(the inequality holds true since the set A® is closed). Therefore, /j,®(A°) = 1, where 

A a = {z G [0, 1)| all 3-adic digits for z, except some finite number of the digits, are equal to 0}. 

Analogously, if Zq = ^, and \il is any weak limit point for the sequence of Khasminskii's averages 

I £ P(z n e .\z = \), 

n<N 

l 

then \il (A 1 ) — 1, where 

A 1 = {x G [0, 1)| all 3-adic digits for x, except some finite number of the digits, are equal to 1}, 

i 

and A n A 1 = 0. This means that [j,® and \il are mutually singular invariant measures for Z. 

Now, let us proceed with the construction of the process. Put m = 2,d = 2 and c{x,u) — c\{x,u\) + 
c 2 (x,u 2 ),x el 2 ,« = {ui,U 2 ) G M 2 . Let II = III x Ll 2 with Hi = 5%, n 2 = (1 - 3p)<5i + p{S 2 + S 3 + 5 4 ). Let the 
part ci to give the radial part of the jump noise: 

\m\ 
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where b G C^M 2 ^) is such that b(x) = for ||.t|| < 1, b(x) > for ||x|| G (1,2) and b(x) = 1 for ||x|| > 2. For 
||x|| > 1, let the part C2 to define the "rotational" part of the noise: if x is written in the polar coordinates as 
(r, 9), then x + c-i[x, i), i = 1, . . . , 4 has the following polar representation: 



For ||se[| > 1, let the functions C2(-,i),i = 1,...,4 be defined in an arbitrary way, such that C2(-,i) G 
C^IR 2 ,^ 2 ), ||x + c(a;,i)|| > 1, x G R 2 ,i = 1, . . . ,4. The drift coefficient let be equal to a(x) = -b(x)-x,x G R m . 

By the construction, condition R holds true (Proposition 14. ip and condition S holds true for every x„ with 
||x*|| > 1 and every t > (Theorem I [25]). Let us show that, however, there exist two different invariant 
measures for X. If X(0) — x is such that ||x|| > 1, then the processes R(-) — ||X(-)|| and O(-) = pf^jn 
are independent (w.r.t. P x ) Markov processes. The first process possesses at least one invariant measure >r, 
supported by [l,+oo) (see Corollary 13. 4p . The second one is the pure jump Markov process with the total 
intensity of the jump equal, at every point, to (1 — 3p) +p + p + p=l. Its embedded Markov chain coincides, 
up to the scaling parameter 2ir, with the chain Z considered before. Therefore, this process possesses at least 
two different invariant measures Xi?X2 on S 2 . Thus, the process X(-) possesses at least two different invariant 
measures \i\ = k x Xi)A*2 = xx X2, supported by [1, +oo) x S 2 = > 1}. 

This example shows that the topological irreducibility condition S, together with the recurrence condition 
R, is not strong enough to produce ergodicity of the solution to SDE with a jump noise. In order to produce 
ergodicity, some kind of "smoothing" condition, like non-degeneracy condition N in our settings, is needed 
additionally. 
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